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Abstract 

Recently, Byrka et al. [1] gave a 1.39- approximation for the Steiner tree problem, 
using a hypergraph-based LP relaxation. They also upper-bounded its integrality gap 
by 1.55. We describe a shorter proof of the same integrality gap bound, by applying 
some of their techniques to a randomized loss-contracting algorithm. 



1 Introduction 

In the Steiner tree problem, we are given an undirected graph G = (V, E) with costs c on 
edges and its vertex set partitioned into terminals (denoted R <Z V) and Steiner vertices 
{y\R). A Steiner tree is a tree spanning all of R plus any subset of V\R, and the problem 
is to find a minimum-cost such tree. The Steiner tree problem is APX-hard, thus the best 
we can hope for is a constant-factor approximation algorithm. 

The best known ratio is a result of Byrka, Grandoni, Rothvofi and Sanita [1]: their 
randomized iterated rounding algorithm gives approximation ratio ln(4) -|- e ~ 1.39. The 
prior best was a 1 + ^ + e 1.55 ratio, via the deterministic loss-contracting algorithm of 
Robins and Zelikovsky [6]. The algorithm of [1] differs from previous work in that it uses 
a linear programming (LP) relaxation; the LP is based on hypergraphs, and it has several 
different-looking but equivalent [2, 5] nice formulations. A second result of [1] concerns the 
LP's integrality gap, which is defined as the worst-case ratio (max over all instances) of the 
optimal Steiner tree cost to the LP's optimal value. Byrka et al. show the integrality gap 
is at most 1.55, and their proof builds on the analysis of [6]. In this note we give a shorter 
proof of the same bound using a simple LP-rounding algorithm. 
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Figure 1: In (i) we show a Steiner tree; circles are terminals and squares are Steiner nodes. 
In (ii) we show its decomposition into full components, and their losses in bold. In (iii) we 
show the full components after loss contraction. 
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We now describe one formulation for the hypergraphic LP. Given a set K C R oi 
terminals, a full component on ii' is a tree whose leaf set is K and whose internal nodes 
are Steiner vertices. Every Steiner tree decomposes in a unique edge-disjoint way into 
full components; Figure l(i) shows an example. Moreover, one can show that a set of 
full components on sets {Ki, . . . ,Kr) forms a Steiner tree if and only if the hypergraph 
{V, {Ki, . . . , Kr)) is a hyper-spanning tree. Let F(K) denote a minimum-cost full component 
for terminal set K C R, and let Ck be its cost. The hypergraphic LP is as follows: 

min '^Crxk ■ (5) 

K 

y0y^scR: ^ xk{\k r\S\-i) <\s\-i 

K:KnSjt0 

Y,xk{\K\-1) = \R\-1 

K 

yK : XK>0 

The integral solutions of (5) correspond to the full component sets of Steiner trees. As 
an aside, the r -restricted full component method (e.g. [4]) allows us to assume there are a 
polynomial number of full components while affecting the optimal Steiner tree cost by a 
1 + e factor. Then, it is possible to solve (S) in polynomial time [1, 8]. Here is our goal: 

Theorem 1. [1] The integrality gap of the hypergraphic LP (S) is at most l + ln3/2 ~ 1.55. 



2 Randomized Loss-Contracting Algorithm 

In this section we describe the algorithm. We introduce some terminology first. The loss 
of full component F{K), denoted by Loss(i^'), is a minimum-cost subset of F(i^)'s edges 
that connects the Steiner vertices to the terminals. For example. Figure l(ii) shows the 
loss of the two full components in bold. We let loss(ii') denote the total cost of all edges 
in Loss(ii'). The loss- contracted full component of K, denoted by LC{K), is obtained from 
F{K) by contracting its loss edges (see Figure l(iii) for an example). 

For clarity we make two observations. First, for each K the edges of LC{K) correspond 
to the edges of F(i^)\Loss(X). Second, for terminals u, v, there may be a uv edge in several 
LC(-ftr)'s but we think of them as distinct parallel edges. 

Our randomized rounding algorithm, RLC, is shown below. We choose M to have value 
at least YIk-^k such that t = Mln3 is integral. MST(-) denotes a minimum spanning tree 
and mst its cost. 
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Figure 2: In (i) we show a terminal spanning tree T in red, and a full component spanning 
terminal set K C {a,b,c,d} in black; thick edges are its loss. In (ii) we show T/K, and 
Drop'jn(-fC) is shown as dashed edges. In (iii) we show MST(T U LC{K)). 



Algorithm RLC. 

1: Let Ti be a minimum spanning tree of the induced graph 
2: X ^ Solve (5) 
3: for 1 < i < t do 

4: Sample Ki from the distribution" with probability ^ for each full component K. 
5: Ti+i ^ MST(Ti U LC{Ki)) 
6: end for 

7: Output any Steiner tree in ALG := Tt+i U Ui=i Loss(Erj). 
"/Ci ^ with probability 1 - J2k ^k/M. 

To prove that ALG actually contains a Steiner tree, we must show all terminals are 
connected. To see this, note each edge uv of T^+i is either a terminal-terminal edge of G[R] 
in the input instance, or else uv G LC(-fCj) for some i and therefore a u-v path is created 
when we add in Loss(i^j). 

3 Analysis 

In this section we prove that the tree's cost is at most 1 + ^ times the optimum value 
of (5). Each iteration of the main loop of algorithm RLC first samples a full component 
Ki in step 4, and subsequently recomputes a minimum-cost spanning tree in the graph 
obtained from adding the loss-contracted part of Ki to Tj. The new spanning tree Tj+i is 
no more expensive than T^; some of its edges are replaced by newly added edges in LC{Ki). 
Bounding the drop in cost will be the centerpiece of our analysis, and this step will in turn 
be facilitated by the elegant Bridge Lemma of Byrka et al. [1]. We describe this lemma first. 

We first define the drop of a full component K with respect to a terminal spanning tree 
T (it is just a different name for the bridges of [1]). Let T/K be the graph obtained from 
T by identifying the terminals spanned by K. Then let 

Drop^iK) := E{T) \ E{KST:{T/ K)), 
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be the set of edges of T that are not contained in a minimum spanning tree of T/K, and 
droprp^K) be its cost. We ihustrate this in Figure 2. We state the Bridge Lemma here and 
present its proof for completeness. 

Lemma 1 (Bridge Lemma [1]). Given a terminal spanning tree T and a feasible solution 
X to {S), 

^xxdrop^(i^) > c{T). (1) 

K 

Proof. The proof needs the following theorem of Edmonds [3]: given a graph H = {R,F), 
the extreme points of the polytope 

{z G : Yl - l-^l ~ ^ yS CR, ^ Ze = - 1} (G) 

{u,v)€F:ueS,veS eGF 

are the indicator variables of spanning trees of H. The proof strategy is as follows. We 
construct a multigraph H = {R, F) with costs c, and z G such that: the cost of z equals 
the left-hand side of (1); z G (G); and all spanning trees of H have cost at least c(T). 
Edmonds' theorem then immediately implies the lemma. In the rest of the proof we define 
H and supply the three parts of this strategy. 

For each full component K with xk > 0, consider the edges in Dropy(X). Contracting 
all edges of E{T) \ Dropj^{K), we see that T)roprp(K) corresponds to edges of a spanning tree 
of K. These edges are copied (with the same cost c) into the set F, and the copies are given 
weight Ze = xk- Using the definition of drop, one can show each e G -F is a maximum-cost 
edge in the unique cycle of T U {e}. 

Having now defined F, we see 

CeZe = ^ Xi^dropy(K). (2) 

e£F K 

Note that we introduce \K\ — 1 edges for each full component K, and that, for any S R, 
at most 15" n K| — 1 of these have both ends in S. These two observations together with the 
fact that X is feasible for (5) directly imply that z is feasible for (Q). 

To show all spanning trees of H have cost at least c(T) , it suffices to show T is an MST 
oiT \J H. In turn, this follows (e.g. [7, Theorem 50.9]) from the fact that each e G -F is a 
maximum-cost edge in the unique cycle of T U {e}. □ 

We also need two standard facts that we summarize in the following lemma. They 
rely on the input costs satisfying the triangle inequality, and that internal nodes of full 
components have degree at least 3, both of which hold without loss of generality. 

Lemma 2. (a) The value mst(G[i?]) of the initial terminal spanning tree computed by 
algorithm RLC is at most twice the optimal value of (S). (b) For any full component K, 
loss(K) < Ck/2. 

Proof. See Lemma 4.1 in [4] for a proof of (b). For (a) we use a shortcutting argument 
along with Edmonds' polytope {Q) for the graph H = G\R]. In detail, let x be an optimal 
solution to (5). For each shortcut a tour of F(i^) to obtain a spanning tree of K with 
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c-cost at most twice Ck (by the triangle inequality) and add these edges to F with z-value 
xk- Like before, since x is feasible for (5), z is feasible for {Q), and so there is a spanning 
tree of G[R\ whose c-cost is at most Yle&F '^e-^e — ^ ^kxr- D 

We are ready to prove the main theorem. 

Proof of Theorem 1. Let x be an optimal solution to (5) computed in step 2, define Ip* 
to be its objective value, and 



loss* = xk'^oss{K) 



K 

its fractional loss. Our goal will be to derive upper bounds on the expected cost of tree Tj 
maintained by the algorithm at the beginning of iteration i. After selecting Ki, one possible 
candidate spanning tree of Tj U LC(-fCj) is given by the edges of Tj \ Dropj,, (i^j) U LC(iCj), 
and thus 

c(T,+i) < c{Ti) - dropy^(i^i) + c(LC(i^,)). (3) 

Let us bound the expected value of Tj+i, given any fixed Tj. Due to the distribution 
from which Ki is drawn, and using (3) with linearity of expectation, we have 

E[c{T,+^)] < c(T,) - ^ ^x,,dropr^(i^) + ^ J^x^lCi, - loss(i^)). 

K K 

Applying the bridge lemma on the terminal spanning tree Tj, and using the definitions of 
Ip* and loss*, we have 

E[c(r,+i)] < (1 - ij)B[cm + (Ip* - loss*)/M 

By induction this gives 

E[c(T,+i)] = (1 - jjYcin) + (ip* - loss*)(l - (1 - ^Y) 
<lp*(l + (l-^)*)-loss*(l-(l- Jy)*). 

where the inequality uses Lemma 2(a) . The cost of the final Steiner tree is at most c{ALG) < 
c{Tt+i) + Yll=i loss(ifj). Moreover, 

E[c{ALG)] < E[c{Tt+i)] + t ■ loss*/M 

< lp*(l + (1 - ijY) + loss*((l - ^)* + ^ - 1) 

< 1 *(- l(i - J_V J- 

- ^V2'^2V m) ^2M 

< lp*(l/2 + 3/2 • exp(-t/M) + t/2M) 

where the third inequality uses (a weighted average of) Lemma 2(b). The last line explains 
our choice of t = Mln3 since A = ln3 minimizes ^ + |e~'^ + ^, with value 1 + Thus 
the algorithm outputs a Steiner tree of expected cost at most (1 + which implies 

the claimed upper bound of 1 + ^^ on the integrality gap. □ 



5 



We now discuss a variant of the result just proven. A Steiner tree instance is quasi- 
bipartite if there are no Steiner-Steiner edges. For quasibipartite instances, Robins and 
Zelikovsky tightened the analysis of their algorithm to show it has approximation ratio a, 
where a ~ 1.28 satisfies a = 1 + exp(— a)). Here, we'll show an integrality gap bound of a 
(the longer proof of [1] via the Robins-Zelikovsky algorithm can be similarly adapted). We 
can refine Lemma 2(a) (like in [6]) to show that in quasi-bipartite instances, mst(G[i2]) < 
2(lp* — loss*). Continuing along the previous lines, we obtain 

E[c{ALG)] < lp*(l + exp(-f/M)) + loss*(t/M - 1 - exp(-t/M)) 

and setting t = aM gives E[c{ALG)] < a • Ip*, as needed. We note that in quasi-bipartite 
instances the hypergraphic relaxation is equivalent [2] to the so-called bidirected cut relax- 
ation thus we get an a integrality gap bound there as well. 

At the risk of numerology, we conclude by remarking that 1 -|- ^ arose in two very 
different ways, by analyzing different algorithms (and similarly for a ~ 1.28). A simple 
explanation for this phenomenon would be very interesting. 
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